1 Introduction

The intention of this data audit is to find any records from the Neotoma Paleoecology Database which potentially violate Neotoma’s statement of values, especially with respect to Neotoma’s goal of aligning with principles of Indigenous data sovereignty.

2 Are any sites in Neotoma located on federal Indigenous lands?

2.1 Method

We did a spatial join for every site in Neotoma with a unique site ID to shapefiles of the borders of federal Indigenous in the United States and Canada, and Indigenous protected areas in Australia, and we tallied and mapped all those which intersected the borders of federal reservations. See list below.

2.2 Next Steps

Our next steps are…

3 Are the coordinates for any sites in Neotoma fuzzed?

3.1 Methods

We checked for all sites whether both the latitude and longitude were exactly divisible by 0.25, or any sites where latitude or the longitude has a precision of 0 or 1 decimal places. If they were, we said they were fuzzed. Notice that this is a conservative method. There are likely fuzzed sites in Neotoma whose coordinates are not exactly divisible by 0.25, or that have precision greater than 1 decimal place. We found 3769 such fuzzed sites. The table below documents their siteids and names, and the map below documents their locations.

3.2 Next steps

our next steps…

4 Are any samples in Neotoma from humans?

4.1 Method

We downloaded Neotoma’s taxa table and selected any taxon IDs which might describe people.(Taxon ID 6359 is Primates, and 6171 is Mammalia.)

Then we used a Neotoma API to search for any occurrences of those taxon IDs.

The two maps below show the sites they come from, and the table documents what information there is about those samples from the samples table in Neotoma. Rows colored red are risk category 1 because they come from the United States. Rows colored orange are risk category 2 because they come from elsewhere.

## Warning: sf layer has inconsistent datum (+proj=longlat +datum=NAD83 +no_defs).
## Need '+proj=longlat +datum=WGS84'

4.2 Next steps

Next steps are….

5 Are any of Neotoma’s radiocarbon dates derived from humans?

5.1 Methods

We searched through two fields (notes and materialdated) from Neotoma’s geochronology table for any occurrences of words from the dictionary below.

Any rows from the geochronology table which contained one of the above words is listed in the table below. Notice that not all of these radiocarbon dates is necessarily problematic, only potentially. Further scrutiny may be needed. (We also checked against CARD’s list of radiocarbon dates deriving from human ancestors that are duplicated in Neotoma, and there was agreement between the two lists: all 60 of CARD’s records that are also in Neotoma are in the below table.)

We assigned risk categories as follows: any references to human bone were assigned risk level 1. Any references to human feces were assigned risk level 2. All other items were given risk level 3. It should be noted that references to bone collagen without taxon are ambiguous and may require reference back to original papers to determine taxonomic source. In particular, the bone collagen from a burial cairn may require further scrutiny.

5.2 Next steps

Actually scrutinizes these records.

6 Are any of the collection units for Neotoma’s records from culturally sensitive areas?

6.1 Methods

We used the same dictionary from the last query to search through two fields in Neotoma’s collection units table (location and notes). Any collection units that returned one of the above words is reproduced below.

We also took another approach, wherein we selected any depostional environment IDs that corresponded to archaeological sites (see table below), and filtered for all collection units that corresponded to any of those depositional environment IDs. This returned 1812 results.

6.2 Next steps

Need to actually scrutinize these records.